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Search History Transcript 



Page 1 of 1 



WEST Search History 

■ Hide Items | Rest<»e | . G<ear | Cancel: | 

DATE: Monday, May 09, 2005 



Hide? Set Name Query Hit Count 

DB=PGPB. USPT,EPAB; PLUR=YES; OP=ADJ 



□ 


L24 


20050074400.pn. 


1 


□ 


L23 


L22andll7 


16 


□ 


L22 


13 .ti. 


30 


□ 


L21 


20030124128.pn. 


1 


□ 


L20 


20030087250.pn. 


1 


□ 


L19 


L18not@ay>2002 


72 


□ 


L18 


L17andL16 


123 


□ 


L17 


cancerS or neoplas$ or tumor$ 


162015 


□ 


L16 


L15orL14 


320 


□ 


L15 


LS.clm. 


251 


□ 


L14 


L5.ab. 


129 


□ 


L13 


L3.ab. 


64 


□ 


L12 


Lll andL6 


0 


□ 


Lll 


L5 and LIO 


894 


□ 


LIO 


L9orL8orL7 


59922 


□ 


L9 


(536/23.1)![CCLS] 


11747 


□ 


L8 


(530/350)![CCLS] 


16729 


□ 


L7 


(435/4 1435/6 1435/7.1 1435/7.23 |435/7.92 |43 5/7.94)! [CCLS] 


42303 


□ 


L6 


L5 and L2 


4 


□ 


L5 


L4orL3 


3142 


□ 


L4 


OPN$ 


1427 


□ 


L3 


osteopontin 


1850 


□ 


L2 


weber.in. 


14421 


□ 


LI 


wber.in. 


0 



END OF SEARCH HISTORY 



http://westbrs:9000^in/cgi-bin/srchhist.pl?state=h6kh9p.26.1&f=fFsearch&userid=bfetterolf 



5/9/05 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: May 4, 2005, 19:25:36 ; Search time 3779 Seconds 

(without alignments) 
11039.942 Million cell updates/sec 

Title: US-10-67 8-355-5 

Perfect score: 861 

Sequence: 1 atgagaattgcagtgatttg gtgcatcttctgaggtcaat 861 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 4708233 seqs, 24227607955 residues 

Total number of hits satisfying chosen parameters: 9416466 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : GenEmbl:* 



1 




gb ba : * 


2 




gb htg:* 


3 




gb in:* 


4 




gb om: * 


5 




gb ov:* 


6 




gb__pat : * 


7 




gb_ph : * 


8 




gb pi : * 


9 




gb pr:* 


10: 


gb ro : * 


11: 


gb sts:* 


12: 


gb sy:* 


13: 


gb un : * 


14: 


gb_vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 861 100.0 861 9 HUMU105MGC 

2 857.8 99.6 864 6 AX040444 



D28761 Homo sapien 
AX040444 Sequence 



3 


854.6 


99. 


3 


1572 


.9 


HSM808149 


BX648003 Homo sapi 


4 


778.4 


90. 


4 


900 


9 


HUMU105MGB 


D28760 Homo sapien 


5 


772 


89 . 


7 


903 


6 


AX040442 


AX040442 Seauence 


6 


772 


89. 


7 


903 


12 


BT007483 


BT0074 83 Svnthetic 


7 


772 


89. 


7 


1447 


6 


C0861588 


C0861588 Seauence 


8 


772 


89 . 


7 


1447 


9 


HUMOSTRO 


lT04 7 6S Human o*5't"PO 


9 


772 


89 . 


7 


1469 


6 


AX375585 


AX375585 Seauence 


10 


772 


89. 


7 


1486 


9 


BC007016 


BC007016 Homo saoi 


11 


772 


89. 


7 


1520 


6 


CQ727525 


C0727525 Seauence 


12 


772 


.89. 


7 


1524 


6 


CQ833972 


C0833972 Seauence 


13 


772 


89. 


7 


1524 


6 


AX663645 


AX663645 Sequence 


14 ' 


772 


89 . 


7 


1524 


9 


AF052124 


AF052124 Homo saoi 

v ^ ^ A XXV^XLi^y 1^ bX k,^ 


15 


772 


89 . 


7 


1549 


9 


BC022844 


BC0228 4 4 Homo sani 


16 


771 


89. 


5 


942 


9 


HUMU105MGA 


D28759 Homo sanien 

Xi/£>w r «r XXWXILw i3Clk>JLCXA 


17 


768 . 8 


89. 


3 


1493 


6 


AR364591 


AR364591 Seauence 


18 


766.2 


89. 


0 


945 


6 


AX040440 


AX040440 Sequence 


19 


766. 2 


89 . 


0 


1422 


6 


AR234558 


AR234558 Seauence 


20 


766.2 


89 . 


0 


1424 


6 


BD234065 


BD234065 Method fo 


21 


766.2 


89. 


0 


1424 


6 


AR308010 


AR308010 Seauence 


22 


766.2 


89. 


0 


1424 


6 


AX702085 


AX702085 Seauence 


23 


766. 2 


89 . 


0 


1424 


9 


HSOP 


XI 3 694 Human mRNA 

« ^ «L ^ \y ^ ^ X X VXX I iW4 XX X L LX XL ^ 


24 


766.2 


89 . 


0 


1476 


9 


AK057738 


AK057738 Homo sapi 


25 


766.2 


89 . 


0 


1499 


9 


BC017387 


BC017387 Homo saoi 

X^ Vm' v ^ f \^ f XX^^Xil^> wCXKiZ-iL 


26 


766.2 


89. 


0 


2254 


6 


CQ412697 


CQ412697 Sequence 


27 


764 . 6 


88 . 


8 


1278 


9 


HUMNEPPON 


M83248 Homo sapien 


28 


764 . 6 


88 . 


8 


1712 


6 


BD134496 


BD134496 Human nuc 


29 


764 . 6 


88 . 


8 


1712 


6 


BD203724 


BD2 03724 Human nuc 

imf \J -^mJ r £^ ^ XX ^X 1 ICX XX XX \Jk 


30 


764 . 6 


88 . 


8 • 


1712 


6 


AX014879 


AX014879 Sequence 


31 


764 . 6 


88 . 


8 


1712 


6 


AX017422 


AX017422 Sequence 


32 


764 . 6 


88. 


8 


1712 


6 


AX525126 


AX525126 Sequence 


33 


763 


88 . 


6 


1505 


6 


CQ414711 


CQ414711 Sequence 


34 


737. 4 


85. 


6 


1482 


6 


CQ413850 


CQ413850 Sequence 


35 


700.2 


81 . 


3 


1096 


6 


AX392707 


AX392707 Sequence 


36 


671.2 


78. 


0 


1410 


9 


AK075463 


AK075463 Homo sapi 


37 


547 


63. 


5 


800 


6 


BD072439 


BD072439 Secreted 


38 


448.8 


52. 


1 


549 


6 


CQ704892 


CQ704892 Sequence 


39 


448 .6 


52. 


1 


1437 


4 


RABOPN 


D16544 Oryctolagus 


40 


444.6 


51. 


6 


1404 


4 


RABOSP 


D11411 Oryctolagus 


41 


437.6 


50. 


8 


552 


6 


AX082193 


' AX082193 Sequence 


42 


430.2 


50. 


0 


465 


6 


CQ698180 


CQ698180 Sequence 


43 


422.4 


49. 


1 


1430 


4 


SSSPPl 


X16575 Porcine Spp 


44 


421.8 


49. 


0 


440 


6 


CQ703436 


CQ703436 Sequence 


45 


403.4 


46. 


9 


10881 


9 


HUMOSTP 


D14813 Homo sapien 



ALIGNMENTS 



RESULT 1 
HUMU105MGC 
LOCUS 
2003 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 



HUMU105MGC 

Homo sapiens 
D28761 

D28761.1 GI 



861 bp mRNA 
mRNA for OPN-c, partial cds . 
: 633076 



linear PRI 08-FEB- 



SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 861) 

Saitoh, Y., Kuratsu^J., Takeshima, H . , Yamamoto,S. and Ushio,Y. 
Expression of osteopontin in human glioma. Its correlation with 



REFERENCE 

AUTHORS 
TITLE 
the 



JOURNAL 

MEDLINE 
PUBMED 
REFERENCE 

AUTHORS 

TITLE 

JOURNAL 
Medical 

Kumamoto 

FEATURES 

source 



malignancy 

Lab. Invest. 72 (1), 55-63 (1995) 

95139605 
7837791 

2 (bases 1 to 861) 

Saitoh, Y. 

Direct Submission 

Submitted (28-FEB-1994) Yoshiki Saitoh, Kumamoto University 

School, Department of Neurosurgery; 1-1-1 Honjo, Kumamoto, 

860, Japan (Tel : 81-96-344-2111 (ex. 5684 ) , Fax:81-96-371-8064) 
Location/Qualifiers 
1. .861 

/organism="Homo sapiens" 
/mol_t yp e= "mRNA" 
/db_xref="taxon: 9606" 
/cell_line="U-105MG" 
/cell_type="astrocytoma cell" 
1. . .>861 

/note="alternative splicing variant 
osteopontin" 

/ codon_start=l 
/product="OPN-c" 
/protein_id="BAA05951 . 1" 
/db xref="GI: 992950" 



/translation="MRIAVICFCLLGITCAIPVKQADSGSSEEKQNAVSSEETNDFKQ 
ETLPSKSNESHDHMDDMDDEDDDDHVDSQDSIDSNDSDDVDDTDDSHQSDESHHSDES 
DELVTDFPTDLPATEVFTPWPTVDTYDGRGDSWYGLRSKSKKFRRPDIQYPDATDE 

HITSHMESEELNGAYKAIPVAQDLNAPSDWDSRGKDSYETSQLDDQSAEAHSHKQSRL 

YKRKANDESNEHSDVIDSQELSKVSREFHSHEFHSHEDMLWDPKSKEEDKHLKFRIS 

HELDSASSEVN" 

ORIGIN 



CDS 



0; 



Query Match 100.0%; Score 861; DB 9; Length 861; 

Best Local Similarity 100.0%; Pred. No. 2.5e-227; 

Matches 861; ' Conservative 0; Mismatches 0; Indels 0; 



Gaps 



Qy 

Db 



1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I 
1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 





61 


CAGGCT GATT CT GGAAGTTCTGAGGAAAAGCAGAATGCT GTGT C PT CT G A AG7VAACC AAT 


1 90 






I 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db , 


61 


CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 


120 


Ov 


121 


GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 


180 






1 1 1 1 1 i 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 
1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 


180 


Ov 


181 


AT G GAT GAT GAAGAT GAT GAT GAG CAT G T G GACAG C GAG GACT C CAT T GACT C GAAC GAC 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 11 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1- 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 


240 


Ov 


241 


TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTrTPArrATTrTGAT 


■J V V/ 






1 1 1 1 1 t 1 1 1 1 1 1 1 i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 


300 


Ov 


301 


GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTrArT 


fin 






1 1 1 1 1 1 1 1 i 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 


360 


Ov 


361 


CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGAPTG 


1 ^ v 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 


420 


Ov 


421 


AGGT CAAAAT CTAAGAAGT TT C GCAGAC CT G AC AT r C A GT A r r r T G A T GP T A P A f^a P A n 


o u 






1 1 1 ] 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i M 1 1 i i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 

1 1 1 I 1 1 1 1 1 1 1 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 


480 


Ov 


481 


CACAT CAC CT CACACAT GGAAAGC GAGGAGTT GAAT GGT GCAT ACAAGGCP AT CCCC GTT 


1 u 






i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 M 1 1 1 i i 1 1 1 1 1 1 i 1 

1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 


540 


Qv 


541 


GC C CAG GAC CT GAACGC G C CT T CT GAT T G GGACAGCC GT GGGAAG GACAGTTAT GAAACG 


600 






1 1 1 ( 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 i 1 1 1 i 1 1 1 t 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 




Db 


541 


GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 


600 


Ov 


601 


AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGPAGTPPAGATTATATAAG 


VJ D VJ 






1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 




Db 


601 


AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 


660 


Ov 


661 


C GGAAAG CTAAT GAT GAGAGCAAT GAGCATT C C GAT GT GATTGAT AGT P AGG A A P TTT P P 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 

1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 


720 


Ov 


721 


AAAGTCAGCCGTGAATTPPAPAGPPATGAATTTPAPAGPPATGAAGATAT<^PTrjr;TTGTA 








1 1 1 i 1 1 1 1 1 1 1 i 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 








A A AfZTP ACPPnTTZZi ZiTTPPZiPZiCPPZiTnzi ZiT'TTr'apar'r'r'ZiT'r'Zi a r'ZiT'ZiTr'r''pr'r'T"T'r''pa 


T R n 


Qy 


781 


GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


781 


GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 


840 


Qy 


841 


AGTGCATCTTCTGAGGTCAAT 861 








1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 




Db 


841 


AGTGCATCTTCTGAGGTC7\AT 861 





us-10-678-355-5.rng 



GenCore version 5,1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



May 4, 2005, 09:10:38 ; Search time 535 Seconds 

(without alignments) 
9526.911 Million cell updates/sec 

US-10-678-355-5 
861 

1 atgagaattgcagtgatttg gtgcatcttctgaggtcaat 861 

IDENTITY^NUC 

Gapop 10.0 , Gapext 1.0 

4390206 seqs. 2959870667 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



8780412 



Post-processing; 



Database 



: Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

N_Geneseq_16Dec04 : * . 

1: geneseqnl980s:* 

2: geneseqnl990s:* 

3 : geneseqn2000s : * 

4: geneseqn2001as: * 

5 : geneseqn2001bs : * 

6: geneseqn2002as:* 

7: geneseqn2002bs:* 

8: geneseqn2003as:* 

9: geneseqn2003bs:* 

10 : geneseqn2003cs : * 

11: geneseqn2003ds : * 

12 : geneseqn2004as : * 

13 : geneseqn2004bs : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



% 

Result Query 

No. Score Match Length DB 



SUMMARIES 



ID 



Description 



1 


861 


100.0 


861 


6 


ABS76484 


Abs76484 cDNA enco 


2 


861 


100.0 


861 


8 


ACC50249 


ACCS0249 Breast ca 


3 


861 


100.0 


861 


8 


ACF12916 


Acfl2916 Human cer 


4 


861 


100.0 


861 


10 


ADB75482 


Adb75482 Prostate 


5 


861 


100.0 


861 


13 


ADQ88469 


Adq88469 Human OPN 


6 


857.8 


99.6 


864 


4 


AAC62290 


Aac62290 DNA encod 


7 


835.2 


97.0 


959 


13 


ADS10300 


Adsl0300 Human the 


8 


776.8 


90.2 


900 


13 


ADQ88468 


Adq88468 Human OPN 


9 


772 


89.7 


903 


4 


AAC62289 


Aac62289 DNA encod 


10 


772 


89.7 


1447 


3 


AAA88861 


Aaa88861 Human ost 
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11 772 89.7 1447 6 ABT10873 

12 772 89.7 1447 8 ACA64902 

13 772 89.7 1447 10 ADF90545 

14 772 89.7 1447 12 ADP13452 

15 772 89.7 1447 13 ADR52870 

16 772 89.7 1469 6 ABK10276 

17 772 89.7 1493 2 AAX33994 

18 772 89.7 1524 6 ABS76483 

19 772 89.7 1524 6 ABT10162 

20 772 89.7 1524 6 ABQ81546 

21 772 89.7 1524 6 ABV78007 

22 772 89.7 1524 8 ABX76457 

23 772 89.7 1524 8 ACC50248 

24 772 89.7 1524 8 ACF12915 

25 772 89.7 1524 10 ADB75480 

26 772 89.7 1524 10 ADF90546 

27 772 89.7 1524 10 ACC46769 

28 772 89.7 1524 11 ADN39798 

29 772 89.7 1524 11 ADN39189 

30 772 89.7 1524 11 ADP65060 

31 772 89.7 1524 11 ADP65614 

32 772 89.7 1524 12 ADL70219 

33 772 89.7 1524 12 ADN59611 

34 772 89.7 1524 12 ADP13543 

35 772 89.7 1524 12 ADQ29582 

36 772 89.7 1524 13 ADR05365 

37 772 89.7 1524 13 ACN38131 

38 772 89.7 1524 13 ADP24343 

39 772 89.7 1524 13 ADR66549 

40 772 89.7 1524 13 ADR66207 

41 772 89.7 1549 13 ADR44024 

42 772 89.7 1616 12 AD028660 

43 772 89.7 1734 12 ADQ24962 

44 772 89.7 2209 11 ACN92777 

45 771 89.5 942 6 ABS76482 



Abtl0873 Human bre 
Aca64902 Human ost 
Adf90545 Human hep 
Adpl3452 Renal eel 
Adr52870 Drug then 
Abkl0276 Human cDN 
Aax33994 Human ost 
Abs76483 cDNA enco 
Abtl0162 Human bre 
Abq81546 Gene up-r 
Abv78007 Hypoxia-r 
Abx76457 Lung cane 
ACC50248 Breast ca 
Acfl2915 Human cer 
Adb75480 Prostate 
Adf90546 Human hep 
ACC46769 Human COP 
Adn39798 Cancer/an 
Adn39189 Cancer/an 
Adp65060 Human sec 
Adp65614 Human do 
Adl70219 Colon can 
Adn59611 Osteopont 
Adpl3543 Renal eel 
Adq29582 Human col 
Adr05365 Human cDN 
Acn38131 Tumour-as 
Adp24343 PRO polyp 
Adr66549 Human pro 
Adr66207 Human pro 
Adr44024 Human col 
Ado28660 Human ost 
Adq24962 Human sof 
Acn92777 Breast ca 
Abs76482 cDNA enco 



ALIGNMENTS 



RESULT 1 
ABS76484 

ID ABS76484 standard; cDNA; 861 BP. 

XX 

AC ABS76484; 
XX 

DT ll-DEC-2002 (first entry) 
XX 

DE CDNA encoding human ovarian cancer marker OV50. 

XX 

KW Human; ovarian cancer; marker; cancer; familial history; brain disorder; 

KW central nervous system disorder; bacterial meningitis; viral meningitis; 

KW Alzheimer's disease; Parkinson's disease; cerebral oedema; hydrocephalus; 

KW brain herniation; inflammation; encephalitis; testicular disorder; 

KW nontubereulous granulomatous orchitis; connective tissue disorder; 

KW heart disorder; ischaemic heart disease; atherosclerosis; neoplasm; 

KW histological type; carcinogenic; ovarian cancer marker; gene; ss. 
XX 

OS Homo sapiens. 

XX 

PN WO200271928-A2. 
XX 

PD 19-SEP-2002. 
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XX 

PF 14-MAR-2002; 2002WO-US007826. 

XX 

PR 14-MAR-2001; 2001US-0276025P. 

PR 14-MAR-2001; 20O1US-O276026P. 

PR lO-AUG-2001; 2001US-0311732P. 

PR 19-SEP-2001; 2001US-0323580P. 

PR 26-SEP-2001; 2001US-0324967P. 

PR 26-SEP-2001; 2001US-0325102P. 

PR 26-SEP-2001; 2001US-0325149P. 
XX 

PA (MILL-) MILLENNIUM PHARM INC. 
XX 

PI Monahan JE, Gannavarapu M, Hoersch S, Kamatkar S, Kovatis SG; 

PI Meyers RE, Morrisey MP, olandt PJ, sen A, vieby PO, Mills GB; 

PI Bast RC, Lu K, schmandt RE, Zhao X, Glatt K; 

XX 

DR WPI; 2002-723277/78. 

DR P-PSDB; ABG96388. 
XX 

FT Assessing whether a patient is afflicted with ovarian cancer, useful in 

PT assessing the stage or progression of the disease, comprises comparing 

PT the expression level of a cancer marker in a sample from a patient ana 

PT from a non cancer patient. 

XX 

PS Disclosure; Page 357; 481pp; English. 

XX 

CC The present invention relates to a new method for assessing whether a 

cc patient is afflicted with ovarian cancer. The method involves comparing 

CC the expression level of a marker in a patient sample and the normal level 

cc of expression of the marker in a control non-ovarian cancer sample, where 

cc the marker is selected from 363 cancer markers described in the 

cc specification. The method of the invention is useful in diagnosing or 

cc characterising cancer, in detecting the presence of cancer as early as 

cc possible, and the recurrence of ovarian cancer. The method may also be of 

cc particular use with patients having an enhanced risk of developing 

cc ovarian cancer (e.g. patients having a familial history of ovarian 

cc cancer). The cancer markers may be used in the management and treatment 

cc of e.g. brain and central nervous system disorders (e.g. bacterial and 

cc viral meningitis, Alzheimer's disease or Parkinson's disease), brain 

cc disorders (e.g. cerebral oedema, hydrocephalus or brain herniations), 

cc inflammations (e.g. bacterial or viral meningitis or encephalitis), 

CC testicular disorders (e.g. nontuberculous granulomatous orchitis), 

cc connective tissue disorders, or heart disorders (e.g. ischaemic heart 

CC disease or atherosclerosis). The compositions and methods may also be 

cc used in assessing the histological type of neoplasm associated with 

cc ovarian cancer, monitoring the progression of ovarian cancer, determining 

cc whether ovarian cancer has metastasized or is likely to metastasize, 

cc selecting a composition for inhibiting ovarian cancer, assessing the 

cc ovarian carcinogenic potential of a compound, or inhibiting ovarian 

cc cancer or at risk of developing ovarian cancer. The present nucleic acid 

cc sequence encodes one of the ovarian cancer markers described in the 

cc invention 

XX 

SQ sequence 861 BP; 261 A; 193 C; 204 G; 203 T; 0 U; 0 Other; 

Query Match 100.0%; Score 861; DB 6; Length 861; 

Best Local similarity 100.0%; Pred. No. 5.1e-244; 

Matches 861; conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

^ llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 
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Qy 


C 1 

61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


1 O 1 

181 


Db 


181 


Qy 


241 


Db 


At 

241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


1 1 
421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


r >i "i 

541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 
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CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

llllllllllllllllllllillllllllillllllllllllllllllllllllllllll 
CAGGGTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

ATGGATGATGAAGATGATGATGACCATGT6GACAGCCAGGACTCCATTGACTCGAACGAC 2 40 
llllllllllllllllllllllllllllllllllllllllllilllllllllllllllll 
ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 2 40 

TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

MIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 

CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

lllllllllllllllllllllllllillllllllllllllllllllllllllllllllll 
CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 

lllllllllllllilllillllllllllllllllllllllllllllllllllllllllll 
CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 

GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

lllllllllllllllllllllllllllllillllllllillllllllllMIIIIIIIII 

GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 7 80 

llllllllllllllllllliilllllllllllllllllllllllllllllllllllllll 
AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 8 40 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllillll 
GACCCOW^GTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGA 840 

AGTGCATCTTCTGAGGTCAAT 861 

IMIIIIIIIIIIIIIIIIII 

AGTGCATCTTCTGAGGTCAAT 861 



RESULT 2 
ACC50249 

ID ACC50249 standard; cDNA; 861 BP. 
XX 

AC ACC50249; 
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XX 

DT 12-DUN-2003 (first entry) 
XX 

DE Breast cancer associated cDNA sequence SEQ ID NO: 339. 

XX 

KW Human; breast cancer; cytostatic; gene therapy; gene; ss. 
XX 

OS Homo sapiens. 
XX 

PN WO2003004989-A2. 
XX 

PD 16-DAN-2003. 
XX 

PF 21-JUN-2002; 2002WO-US019669. 
XX 

PR 21-JUN-2001; 2001US-0299887P. 

PR 27-3UN-2001; 2001US-0301572P. 

PR 18-JUL-2001; 2001US-0306501P. 

PR 25-SEP-2001; 2001US-0325002P. 

PR 05-^4AR-2002; 2002US-0362585P. 

PR 14-MAy-2002; 2002US-0380391P. 
XX 

PA (MILL-) MILLENIUM PHARM INC. 
XX 

PI Li 1 lie 3, Gannavarapu M, Glatt K, Hoersh S, Kamatkar s; 

PI Mertens M, Monahan JE, Myer V, Wang Y, Xu Y, Zhao X, Meyers RE; 

PI Bast RC, Hortobagyi GN, Pusztai L, Meric F, Sahin A, Mil Is GB; 

XX 

DR WPI; 2003-210381/20. 

DR P-PSDB; ABR47551. 
XX 

PT Breast cancer diagnosis or treatment by comparing the level of expression 

PT of a marker in a patient sample with that in the control non-breast 

PT cancer sample. 
XX 

PS claim 1; SEQ ID NO 339; 128pp; English. 
XX 

cc The present invention describes a method for assessing whether a patient 

CC is afflicted with breast cancer. The method comprises comparing the level 

cc of expression of a marker (gene/polypeptide see ACC50076 to ACC50334 and 

cc ABR47386 to ABR47632) in a patient sample and the normal level of 

cc expression of the marker in a control non-breast cancer sample, where a 

cc significant increase in the level of expression of the marker in the 

cc patient sample and the normal level is an indication that the patient is 

cc afflicted with breast cancer. The breast cancer associated sequences from 

cc the present invention have cytostatic activities and can be used in gene 

cc therapy. The method is useful for diagnosing and treating breast cancer, 

cc N.B. Tne sequence data for this patent did not form part of the printed 

cc specification, but was obtained in electronic format directly from WIPO 

cc at ftp,wipo.int/pub/published_pct_sequences 
XX 

SQ sequence 861 BP; 261 A; 193 C; 204 G; 203 T; 0 u; 0 other; 

Query Match 100.0%; Score 861; DB 8; Length 861; 
Best Local similarity 100.0%; Pred. No. 5.1e-244; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

Qy 61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
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Db 


61 


Qy 


121 


Db 


121 


oy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 
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GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 

Mil I II II II II II I II II I IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 

ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
ATGGATGATGAAGATGATGATGACCATGT6GACAGCCAGGACTCCATTGACTCGAACGAC 

TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 

1 1 Mil III III II 1 1 Mi I II II I III! Ill I II Mill II I III II Mill I llllll 

TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 



GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTrCACT 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 
GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 

CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 

IIIIIIIIIIMIIIMIIIIIIIIIIMIIIIIIMMIIIIIIIIIIIIIIIIIIIII 
CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 

AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 

IIIIMIIIIIII Mill llllll IMMIMIMIIIMIMIMIMMIMMMII 

AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 
CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 

IIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIII 

CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 
GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 

llllllilllllllllllllllllllllllMIIIIIIMIIIIIIIIMIlllllllll 

GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 

AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 

CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 

AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIMIIIII 

AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 

GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 

lllll IIIIMIIIIIII I I IIIIIIIMIIIIIIIIIIIIIIIIIIIMIIII llllll 
GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 

AGTGCATCTTCTGAGGTCAAT 861 

IIIIIIMIIIIMMMIII 

AGTGCATCTTCTGAGGTCAAT 861 



180 
180 
240 
240 
300 
300 
360 
360 
420 
420 
480 
480 
540 
540 
600 
600 
660 
660 
720 
720 
780 
780 
840 
840 



RESULT 3 
ACF12916 

ID ACF12916 standard; cDNA; 861 BP. 
XX 

AC ACF12916; 
XX 

DT lO-SEP-2003 (first entry) 
XX 
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DE Human cervical cancer cell marker encoding cDNA SEQ ID NO: 175. 
XX 

KW Human; cervical cancer; cervical cancer marker; cancer therapy; 

KW detection; gene therapy; vaccine; gene; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO2002101075-A2. 
XX 

PD 19-DEC-2002. 
XX 

PF 12-JUN-2002; 2002WO-US018638. 
XX 

PR 13-3UN-2001; 2001US-0298155P. 

PR 13-3UN-2001; 2001US-0298159P. 

PR 14-NOV-2001; 2001US-0335936P. 

XX 

PA (MILL-) MILLENNIUM PHARM INC. 
XX 



PI Schlegel R, Chen Y, Zhao X, Monahan JE, Kamatkar s; 

PI Gannavarapu M, Glatt K, Hoersch S; 
XX 

OR WPI; 2003-156967/15. 

DR P-PSDB; ABR92133. 
XX 

PT New isolated nucleic acid molecule useful for detecting, characterizing, 

PT preventing and treating human cervical cancers, in various prognostic and 

PT diagnostic assays, in pharmacogenomics and in monitoring clinical trials. 



XX 

PS claim 4; Page 336; 386pp; English. 
XX 

CC ACF12828 to ACF12947 encode the human cervical cancer marker proteins (I) 
CC given in ABR92047 to ABR92164. A higher level of expression of (I) than 



CC normal indicates the presence of cervical cancer. Also described: (1) a 

CC vector (II) containing (I); (2) a host cell (III) containing (I); and (3) 

CC assessing (Ml) whether a patient is afflicted with cervical cancer, 

CC comprising comparing the level of expression of a marker in a patient's 

CC sample, and the normal level of expression of the marker in a control non 

CC -cervical cancer sample, where a significant increase in the level of 

CC expression of the marker in the patient's sample relative to that in the 



CC control sample is an indication that the patient is afflicted with 

CC cervical cancer. (I) has cytostatic activity, and can be used in gene 

CC therapy and in vaccines. (I) is useful in detecting, characterising, 

CC preventing and treating human cervical cancers. (I) may also be used in 

CC various prognostic and diagnostic assays, pharmacogenomics and in 

CC monitoring clinical trials 
XX 

SQ Sequence 861 BP; 261 A; 193 C; 204 G; 203 T; 0 U; 0 other; 

Query Match 100.0%; Score 861; DB 8; Length 861; 

Best Local similarity 100.0%; Pred. No. 5.1e-244; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

Db 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

Qy 61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIII 

Db 61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

Qy 121 GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
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Db 


121 


GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 


180 


Qy 


181 


ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 


240 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

lllllllllllllllllillllllllllllillillllllllllllllllllllllllll 




Db 


181 


ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 


240 


Qy 


241 


TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 


300 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 




Db 


241 


TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGA6TCTCACCATTCTGAT 


300 


Qy 


301 


GAATCTGATGAACTGGTCACTGA 1 1 1 1 CCCACGGACCTGCCAGCAACCGAAG MM CACT 


360 




1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


GAATCTGATGAACTGGTCACTGA I 1 1 ICCCACGGACCTGCCAGCAACCGAAGTTTTCACT 


360 


Qy 


361 


CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 


420 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
liiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 




Db 


361 


CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGT6GTTTATGGACTG 


420 


Qy 


421 


AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 


480 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 


480 


Qy 


481 


CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 


540 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 


540 


Qy 


541 


GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 


600 




1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 i 1 1 i 1 1 




Db 


541 


GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 


600 


Qy 


601 


AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 


660 




1 1 1 1 M 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 i 1 i 1 1 1 
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 




Db 


601 


AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 


660 


Qy 


661 


CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 


720 




1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACI I I CC 


720 


Qy 


721 


AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 


780 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 


780 


Qy 


781 


GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 


840 




IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIII 




Db 


781 


GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTT.CTCATGAATTAGAT 


840 


Qy 


841 


AGTGCATCTTCTGAGGTCAAT 861 






Illllllllllllllllllll 




Db 


841 


AGTGCATCTTCTGAGGTCAAT 861 





RESULT 4 
ADB75482 

ID ADB75482 Standard; cDNA; 861 BP. 

XX 

AC ADB75482; 
XX 

DT 04-DEC-2003 (first entry) 
XX 

DE Prostate cancer marker cDNA. 
XX 

KW Prostate; cancer; cytostatic; gene therapy; marker; ss. 
XX 
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OS Homo sapiens. 

XX 

PN WO2003009814-A2. 
XX 

PD 06-FEB-2003. 
XX 

PF 25-JUL-2002; 2002WO-US023913. 
XX 

PR 25-JUL-2001; 2001US-0307982P. 

PR 22-AUG-2001; 2001US-0314356P. 

PR 25-SEP-2001; 2001US-0325020P. 

PR 12-DEC-2001; 2001US-0341746P. 

PR 05-MAR-2002; 2002US-0362158P. 
XX 

PA (MILL-) MILLENNIUM PHARM INC. 

XX 

PI Schleqel R, Monahan JE, Endege WO, Gannavarapu M, Gorbatcheva B; 

PI Hoersn s, Kamatkar S, Wonsey AM, Glatt K, zhao x, Anderson D; 
XX 

DR WPI; 2003-248033/24. 
XX 

PT New nucleic acid molecule, useful for diagnosing or treating prostate 

PT cancer. 

XX 

PS Disclosure; SEQ ID NO 306; 99pp; English. 
XX 

cc The invention relates to newly discovered cancer markers associated with 

cc the cancerous state of prostate cells. Also disclosed is a method of 

CC assessing whether a patient is afflicted with prostate cancer. The method 

CC of the invention involves assessing whether a patient is afflicted with 

CC prostate cancer by comparing the level of expression of a marker in a 

CC patient sample and the normal level of expression of the marker in a 

CC control non-prostate cancer sample, where a significant increase in the 

CC level of expression of the marker in the patient sample and the normal 

CC level indicates that the patient is afflicted with prostate cancer. 

CC Nucleic acids of the invention are useful for diagnosing or treating 

CC prostate cancer, and may be useful in gene therapy. Sequences given in 

CC ADB75177-ADB75631 represent marker cDNA and proteins. Note: The sequence 

CC data for this patent did not form part of the printed specification, but 

CC was obtained in electronic format directly from WIPO at 

CC ftp . wi po . i nt/pub/publ i shed_pct_sequences . 
XX 

SQ Sequence 861 BP; 261 A; 193 C; 204 G; 203 T; 0 U; 0 Other; 

Query Match 100.0%; Score 861; DB 10; Length 861; 
Best Local Similarity 100.0%; Pred. No. 5.1e-244; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

Qy 61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIilllllillMlilllllllllll 

Db 61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

Qy 121 GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 121 GACTTTAAACAAGAGACCCTTCCAA6TAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

Qy 181 ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 181 ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 240 
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Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 



TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 3 60 
llllllllllllllllllllllllllllllllllllllilllllllllllllllllllll 
GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 3 60 

CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 
AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

lllillllllllllllllllllllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 
llllllllllilllllllllllllllllllllllllllllllllllllllllllllllll 
CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCC6TT 5 40 

GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

llllllllllllllllllllllllllllllllllllllllilllllllllllllllllll 
GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 6 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 
lllllllillllllllllllllllllllllllllllllllllllllllllllllllllll 
CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 
llllllllillllllllllllllllllllllllllllllllllllllllllllllllill 
AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 8 40 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 

AGTGCATCTTCTGAGGTCAAT 861 

Mill Mill MM II 1 1 III 

AGTGCATCTTCTGAGGTCAAT 861 



RESULT 5 
ADQ88469 
ID 
XX 



AC 
XX 
DT 

XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
XX 
OS 



ADQ88469 Standard; DNA; 861 BP. 
ADQ88469; 

07-OCT-2004 (first entry) 

Human OPN-a isoform, osteopontin-c (OPN-c) DNA. 

Tumour; OPN-b; osteopontin-b; OPN-c; osteoponti n-c; carcinoma; sarcoma; 
leukaemia; lymphoma; prostate gland tumour; colon tumour; lung tumour; 
breast tumour; stomach tumour; bladder tumour; ovary tumour; 
thyroid gland tumour; pancreas tumour; liver tumour; glioma; therapy; 
human; osteopontin-a; OPN-a; gene; ds. 

Homo sapiens. 
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XX 

FH Key Location/Qualifiers 

FT CDS 1. .861 

FT /*tag= a 

FT /product= "Human OPN-a isoform, osteopontin-c (OPN-c) 

FT protein" 

FT /partial 

FT /note= "No stop codon" 

XX 

PN US2004142865-A1. 
XX 

PD 22-JUL-2004. 
XX 

PF 02-OCT-2003; 2003US-00678355 . 
XX 

PR 02-OCT-2002; 2002US-0415712P. 
XX 

PA (WEBE/) WEBER G F. 
XX 

PI Weber GF; 
XX 

DR WPI; 2004-533520/51. 

DR P-PSDB; ADQ88475. 

DR GENBANK; D28761. 
XX 

PT Treating patient having tumor, or is at risk of developing a tumor e.g., 

PT sarcoma, carcinoma, leukemia, involves administering an agent that 

PT inhibits expression or activity of osteopontin-b or osteopontin-c, to the 

PT patient. 

XX 

PS claim 5; SEQ ID NO 5; 21pp; English. 
XX 

cc The present invention relates to a method of treating a patient who has a 

CC tumour or who is at risk of developing a tumour. The method involves 

CC administering to the patient an agent that inhibits the expression or 

CC activity of at least one of osteopontin-b or osteopontin-c (oPN-b or OPN- 

CC c). The invention is useful for treating tumour such as carcinoma, 

CC sarcoma, leukaemia or lymphoma and tumour of the prostate gland, colon, 

CC lung, breast, stomach, bladder, ovary, thyroid gland, pancreas or liver 

CC and preferably for treating a patient having glioma. The present sequence 

CC is human osteopontin-a (OPN-a) isoform, OPN-c DNA. 

XX 

SQ sequence 861 BP; 261 A; 193 C; 204 G; 203 T; 0 U; 0 other; 

Query Match 100.0%; score 861; DB 13; Length 861; 
Best Local similarity 100.0%; Pred. No. 5.1e-244; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

nh 1 UiiiiiliUiU' 1 'I' ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

Qy 61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

I I I I I I I I I I I I I i I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 CAGGCTGATTCT6GAAGTTCTGAGGAAAAGCAGAAT6CTGTGTCCTCTGAAGAAACCAAT 120 

Qy 121 GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

NlllllllllllliMlllllllillllllllillllllllllllllllllllllllll 

Db 121 GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

Qy 181 ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 240 

^. Mil Mill II I Mill I llllll Mill MM III III II lllll II MM 1 1 mill 

Db 181 ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 240 
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Qy 


241 


Ob 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 



us-10-678-355-5.rng 

TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 

CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 

GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

llllllllllllllllllllllllllllllllllllllllilllllllllllllllllll 
AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

Mllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 8 40 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 8 40 

AGTGCATCTTCTGAGGTCAAT 861 
llllllllllillllllllll 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



CM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



May 5. 2005, 02:55:48 



Search time 194 Seconds 
(without alignments) 
7262.032 Million cell updates/sec 



US-10-678-355-5 
861 

1 atgagaattgcagtgatttg gtgcatcttctgaggtcaat 861 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



1202784 seqs, 818138359 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



2405568 



Post-processing: 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

lssued_Patents_NA: * 



/cgn2_6/ptodata/l/i na/5A^C0MB . seq : * 
/cgn2_6/ptodata/l/i na/5B_C0MB . seq : * 
/cgn2_6/ptodata/l/i na/6A^C0MB . seq : * 
/eg n2_6/ptodata/l/i na/6B_C0MB . seq : * 
/cgn2_6/ptodata/l/i na/PCTUS_COMB . seq : * 
/cgn2_6/ptodata/l/i na/backf i 1 esl . seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB 



ID 



Description 



1 


772 


89.7 


1509 


4 


US-09-949- 


•016-4492 


Sequence 


4492, Ap 
91, Appl 


2 


772 


89.7 


1524 


4 


US-09-949- 


•016-91 


Sequence 


3 


768.8 


89.3 


1493 


6 


5340934-5 




Patent No. 


5340934 


4 


768.8 


89.3 


1493 


6 


5340934-5 




Patent No. 


5340934 


5 


766.2 


89.0 


1422 


3 


US-09-485- 


-077A-15 


Sequence 


15, Appl 


6 


766.2 


89.0 


1424 


4 


US-09-206- 


-576-1 


Sequence 


1, Appli 


7 


403 


46.8 


601 


4 


US-09-949- 


■016-19586 


Sequence 


19586, A 


8 


403 


46.8 


601 


4 


US-09-949- 


■016-160441 


Sequence 


160441, 


9 


401.8 


46.7 


11683 


4 


US-09-949- 


-016-11833 


Sequence 


11833, A 


10 


401.8 


46.7 


11689 


4 


US-09-949- 


-016-16234 


Sequence 


16234, A 


11 


328.6 


38.2 


601 


4 


US-09-949- 


-016-19585 


Sequence 


19585, A 


12 


328.6 


38.2 


601 


4 


US-09-949- 


-016-160440 


Sequence 


160440, 


13 


317.2 


36.8 


1430 


4 


US-09-774- 


-528-441 


Sequence 


441, App 


14 


296.2 


34.4 


1473 


3 


US-09-485- 


-077A-16 


Sequence 


16, Appl 


15 


261.4 


30.4 


1357 


6 


5340934-7 




Patent No. 


5340934 


16 


261.4 


30.4 


1357 


6 


5340934-7 




Patent No. 


5340934 


c 17 


227 


26.4 


241 


4 


US-09-389- 


-681-315 


Sequence 


315, App 
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c 


18 


227 


26.4 


c 


19 


227 


26.4 


c 


20 


227 


26.4 


c 


21 


227 


26.4 


c 


22 


227 


26.4 


c 


23 


227 


26.4 




24 


216.8 


25.2 




25 


216.8 


25.2 




26 


200.4 


23.3 




27 


200.4 


23.3 




28 


200.4 


23.3 




29 


200.4 


23.3 




30 


200.4 


23.3 




31 


200.4 


23.3 




32 


200.4 


23.3 


c 


33 


121 


14.1 




34 


82 


9.5 




35 


82 


9.5 




36 


59 


6.9 




37 


59 


6.9 




38 


59 


6.9 




39 


59 


6.9 




40 


59 


6.9 




41 


59 


6.9 




42 


59 


6.9 




43 


58.4 


6.8 


c 


44 


58.4 


6.8 




45 


56.4 


6.6 



241 
241 
241 
241 
241 
241 
226 
226 
241 
241 
241 
241 
241 
241 
241 
557 
601 
601 
241 
241 
241 
241 
241 
241 
241 
60 
75 
601 



4 
4 
4 
4 
4 
4 
3 
3 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 



us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 
us- 



10-678- 
■09-620 
-09-433 
-09-604 
-09-834 
-09-590 
-09-551 
-09-712 
-09-712 
-09-389 
-09-620 
■09-433 
■09-604 
■09-834 
■09-590 
-09-551 
-09-451 
-09-949 
■09-949 
■09-389 
■09-620 
■09-433 
•09-604 
•09-834 
•09-590 
•09-551 
■09-888 
09-888 
■09-949 



355-5. rni 
■405B-315 
-826B-315 
287A-315 
759-315 
751A-315 
621-315 
016-64 
016-70 
681-398 
405B-398 
826B-398 
287A-398 
759-398 
751A-398 
621-398 
651-6 
016-19584 
016-160439 
681-316 
405 B- 3 16 
826B-316 
287A-316 
759-316 
751A-316 
621-316 
413-89 
413-91 
016-19578 



Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 



315, App 
315, App 
315, App 
315, App 
315, App 

315, App 
64, App) 
70, Appl 
398, App 
398, App 
398, App 
398, App 
398, App 
398, App 
398, App 
6, Appli 
19584, A 
160439, 

316, App 
316, App 
316, App 
316, App 
316, App 
316, App 
316, App 
89, Appl 
91, App1 
19578, A 



ALIGNMENTS 



RESULT 1 

US-09-949-016-4492 

; Sequence 4492, Application US/09949016 

; Patent No. 6812339 

; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al . 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

; TITLE OF invention: with HUMAN DISEASE, METHODS OF DETECTION AND USES THEREOF 

; FILE REFERENCE: CL001307 

; CURRENT APPLICATION NUMBER: US/09/949 , 016 

; CURRENT FILING DATE: 2000-04-14 

; PRIOR APPLICATION NUMBER: 60/241,755 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 60/237,768 

; PRIOR FILING DATE: 2000-10-03 

; PRIOR APPLICATION NUMBER: 60/231,498 

; PRIOR FILING DATE: 2000-09-08 

; NUMBER OF SEQ ID NOS: 207012 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 4492 

; LENGTH: 1509 

; type: dna 

; ORGANISM: Human 

US-09-949-016-4492 

Query Match 89.7%; Score 772; DB 4; Length 1509; 

Best Local Similarity 92.9%; Pred. No. 4.2e-233; 

Matches 836; Conservative 0; Mismatches 25; Indels 39; Gaps 1; 

Qy 1 ATGAGAATTGCAGTGATTTGC I I I I GCCTCCTAGGCATCACCTGT6CCATACCAGTTAAA 60 
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202 
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682 
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808 
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742 
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868 


Qy 


802 


Db 


928 



us-10-678-355-5.rm 

llllllillllilllllllllllllllllllllllllllllllllillllllllllllll 
ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 147 

CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAG 93 

lllllllllllllllllltllllllllllllll 

CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGCTTTACAACAAATACCCAGATGCTGTG 207 

AATGCTGTGTCCTCTGAAGAAACCAATGACTTTAAACAAGAGACCCTT 141 

II III I MM I III I I II llllllll 
GCCACATGGCTAAACCCTGACCCATCTCAGAAGCAGAATCTCCTAGCCCCACAGACCCTT 267 

CCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGATATGGATGATGAAGATGATGAT 201 

IIMIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIllllllllllllll 

CCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGATATGGATGATGAAGATGATGAT 327 
GACCATGTGGACAGCCAGGACTCCATTGACTCGAACGACTCTGATGATGTAGATGACACT 261 

IMIIIIIIIIIIIIIIIIIIIIIIII IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

GACCATGTGGACAGCCAGGACTCCATTGACTCGAACGACTCTGATGATGTAGATGACACT 387 
GATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGATGAATCTGATGAACTGGTCACT 321 

IIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGATGAATCTGATGAACTGGTCACT 447 

GATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACTCCAGTTGTCCCCACAGTAGAC 381 

lllllllllllllllllllllllllllllllllllllllllllllllllllllillllll 
GATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACTCCTVGTTGTCCCCACAGTAGAC 507 

ACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTGAGGTCAAAATCTAAGAAGTTT 441 

llllllll III II III II I II I I II I II III INI I Mill I MM I III II II I I III I 
ACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTGAGGTCAAAATCTAAGAAGTTT 567 

CGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAGCACATCACCTCACACATGGAA 501 

IIIIIIIIIIIMIIIIIMIillllllMIIIIIIIII IIIMIIIIIIIIIIIIIII 

CGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAGGACATCACCTCACACATGGAA 627 
AGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTTGCCCAGGACCTGAACGCGCCT 561 

IIMIIIMIIIIIIIIIMIIIIIIIIIMIIIIIIIIIMIIMIIIIMIIMMM 

AGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTTGCCCAGGACCTGAACGCGCCT 687 
TCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACGAGTCAGCTGGATGACCAGAGT 621 

MM MM Mill III II III Mill II III II MM III II III Mill llllll III I 

TCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACGAGTCAGCTGGATGACCAGAGT 747 

GCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAGCGGAAAGCTAATGATGAGAGC 681 

llllll IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMII llllllllllll ^ 
GCTGAAACCCACAGCCACAAGCAGTCCAGATTATATAAGCGGAAAGCCAATGATGAGAGC 807 

AATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCCAAAGTCAGCCGTGAATTCCAC 741 

II I II II II II II II II I II M I II II I II II II II I II II II II II II II II I II I I II 
AATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCCAAAGTCAGCCGTGAATTCCAC 867 

AGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTAGACCCCAAAAGTAAGGAAGAA 801 

IIIIMIIIIIIIIIIIIIMIIII IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTAGACCCCAAAAGTAAGGAAGAA 927 
GATAAAC ACCTG AAATTTCGTATTTCTCATG AATTAGATAGTGCATCTTCTGAGGTCAAT 861 

II ill! MMIIIIIIIII llllll MM IIIMIIMI II Ml III III M llllll II 

GATAAACACCTGAAATTTCGTATTTCTCATGAATTAGATAGTGCATCTTCTGAGGTCAAT 987 
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us-10-678-355-5.rnpb 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 
Title: 

Perfect score: 
Sequence: 

Scoring table: 
Searched: 



May 5, 2005, 05:07:26 



Search time 5671 Seconds 
(without alignments) 
928.345 Million cell updates/sec 



US-10-678-355-5 
861 

1 atgagaattgcagtgatttg gtgcatcttctgaggtcaat 861 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



5654200 seqs, 3057283753 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



11308400 



Post-processi ng : 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Publ i shed^Appl i cati ons_NA : * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB . seq : * 

2 : /cgn2_6/ptodata/2/pubpna/PCT«NEW_PUB . seq : * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB . seq : * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB . seq : * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB . seq : * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB . seq : 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq : * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq : * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB . seq : 

10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB . seq 

11: /cgn2_6/ptodata/2/pubpna/US09c_PUBCOMB.seq 

12 : /cgn2_6/ptodata/2/pubpna/us09_NEW_PUB . seq : 

13 : /cgn2_6/ptodata/2/pubpna/USlOA_PUBCOMB . seq 

14: /cgn2_6/ptodata/2/pubpna/USlOB_PUBCOMB.seq 

15 : /cgn2_6/ptodata/2/pubpna/USlOC_PUBCOMB . seq 

16 : /cgn2_6/ptodata/2/pubpna/USlOD_PUBCOMB . seq 

17 : /cgn2_6/ptodata/2/pubpna/USlOE_PUBCOMB . seq 

18 : /cgn2_6/ptodata/2/pubpna/USlOF_PUBCOMB . seq 

19 : /cgn2_6/ptodata/2/pubpna/USlO_NEW_PUB . seq : 

20 : /cgn2_6/ptodata/2/pubpna/USlLNEW_PUB . seq : 

21: /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB.seq: 

22 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB . seq : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



% 

Result Query 

No. Score Match Length DB ID 



SUMMARIES 



Description 

Sequence 230, App 



861 100.0 861 14 US-10-097-340-230 

Page 1 













us-10-678-355-5.rnpb 


2 


861 


100.0 


861 


14 


US-10-171-311-175 


3 


861 


100. 


0 


861 


15 


US-10-205-823-306 


4 


861 


100. 


0 


861 


15 


US-10-177-293-339 


5 


861 


100. 


0 


861 


15 


US-10-301-822-146 


6 


861 


100. 


0 


861 


18 


US-10-678-355-5 


7 


861 


100. 


0 


861 


18 


US-10-775-920-217 


8 


778.4 


90. 


4 


900 


18 


US-10-678-355-3 


9 


778.4 


90. 


4 


900 


18 


US-10-775-920-218 


10 


772 


89. 


7 


1447 


17 


US-10-240-425-1270 


11 


772 


89. 


7 


1447 


18 


US-10-717-597-188 


12 


772 


89.7 


1447 


18 


US-10-775-169-221 


13 


772 


89. 


7 


1447 


18 


US-10-775-920-209 


14 


772 


89. 


7 


1469 


9 


Us-09-919-770-1 


15 


772 


89. 


7 


1469 


19 


US-10-690-880-9 


16 


772 


89. 


7 


1486 


18 


US-10-775-920-210 


17 


772 


89. 


7 


1520 


18 


US-10-775-920-207 


18 


772 


89. 


7 


1524 


14 


US-10-097-340-228 


19 


772 


89. 


7 


1524 


14 


US-10-171-311-173 


20 


772 


89. 


7 


1524 


15 


US-10-205-823-304 


21 


772 


89. 


7 


1524 


15 


US-10-177-293-337 


22 


772 


89. 


7 


1524 


15 


US-10-301-822-144 


23 


772 


89. 


7 


1524 


16 


US-10-269-909-70 


24 


772 


89. 


7 


1524 


17 


US-10-170-385-268 


25 


772 


89. 


7 


1524 


17 


US-10-295-027-507 


26 


772 


89. 


7 


1524 


17 


US-10-295-027'1116 


27 


772 


89. 


7 


1524 


17 


US-10-087-080-11 


28 


772 


89. 


7 


1524 


17 


US-10-240-425-392 


29 


772 


89. 


7 


1524 


17 


US-10-229-345-6 


30 


772 


89. 


7 


1524 


17 


US-10-274-177-6 


31 


772 


89. 


7 


1524 


18 


US-10-717-597-279 


32 


772 


89. 


7 


1524 


18 


US-10-650-112-6 


33 


772 


89. 


7 


1524 


18 


US-10-764-649-7 


34 


772 


89. 


7 


1524 


18 


Us-io-734-564-8 


35 


772 


89. 


7 


1524 


18 


US-10-775-920-208 


36 


772 


89. 


7 


1524 


19 


US-10-278-698-282 


37 


772 


89. 


7 


1524 


19 


US-10-278-698-796 


38 


772 


89. 


7 


1524 


19 


US-10-818-066-1 


39 


772 


89. 


7 


1549 


18 


US-10-775-920-215 


40 


772 


89. 


7 


1734 


18 


US-10-723-860-7782 


41 


772 


89. 


7 


2209 


14 


US-10-198-846-13927 


42 


771 


89. 


5 


942 


14 


US-10-097-340-226 


43 


771 


89. 


5 


942 


14 


US-10-171-311-171 


44 


771 


89. 


5 


942 


IS 


US-10-205-823-302 


45 


771 


89. 


5 


942 


15 


US-10-177-293-335 



Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
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175, App 
306, App 
339, App 
146, App 

5, Appli 

217, App 
3, Appli 

218, App 
1270, Ap 
188, App 
221, App 

209, App 
1, Appli 

9, Appli 

210, App 

207, App 
228, App 
173, App 
304, App 
337, App 
144, App 
70, Appl 
268, App 
507, App 
1116, Ap 
11, Appl 
392, App 

6, Appli 
6, Appli 
279, App 

6, Appli 

7, Appli 

8, Appli 

208, App 
282, App 
796, App 
1, Appli 
215, App 
7782, Ap 
13927, A 
226, App 
171, App 
302, App 
335, App 



ALIGNMENTS 



RESULT 1 

US-10-097-340-230 

Sequence 230, Application US/10097340 
Publication No. US20030087250A1 
GENERAL INFORMATION: 
APPLICANT: John MONAHAN 
APPLICANT: Manjula GANNAVARAPU 

APPLICANT: Sebastian HOERSCH 

APPLICANT: shubhangi KAMATKAR 

APPLICANT: Steve G. KOVATS 

APPLICANT: Rachel E. MEYERS 

APPLICANT: Michael MORRISEY 

APPLICANT: Peter OLANDT 

APPLICANT: Ami SEN 
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us-10-678-355-5.rnpb 

APPLICANT: Peter VEIBY 
APPLICANT: Gordon B. MILLS 
APPLICANT: Robert c. bast, 3r. 
APPLICANT: Karen LU 
APPLICANT: Rosemarie SCHMANDT 
APPLICANT: xumei ZHAO 
APPLICANT: Karen GLATT 

TITLE OF INVENTION: Nuclei c Acid Molecules and Proteins For The identification, 
TITLE OF INVENTION: Assessment, Prevention, and Therapy of ovarian Cancer 
FILE REFERENCE: MRI-030 

CURRENT APPLICATION NUMBER: US/10/097 , 340 
CURRENT FILING DATE: 2002-03-14 
PRIOR APPLICATION NUMBER: 60/276,025 
PRIOR FILING DATE: 2001-03-14 
PRIOR APPLICATION NUMBER: 60/325,149 
PRIOR FILING DATE: 2001-09-26 
PRIOR APPLICATION NUMBER: 60/276,026 
PRIOR FILING DATE: 2001-03-14 
PRIOR APPLICATION NUMBER: 60/324,967 
PRIOR FILING DATE: 2001/09/26 
PRIOR APPLICATION NUMBER: 60/311,732 
PRIOR FILING DATE: 2001-08-10 
PRIOR APPLICATION NUMBER: 60/325,102 
PRIOR FILING DATE: 2001-09-26 
PRIOR APPLICATION NUMBER: 60/323,580 
PRIOR FILING DATE: 2001-09-19 
NUMBER OF SEQ ID NOS: 363 

SOFTWARE: FastSEQ for Windows version 4.0 
SEQ ID NO 230 
LENGTH: 861 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-097-340-230 

Query Match 100.0%; Score 861; DB 14; Length 861; 

Best Local Similarity 100.0%; Pred. No. 4.6e-255; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

U^iMiUiJ-iUiiiU' ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 
CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

MIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIMIII 

CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 12 0 
GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

llllllllllllllillMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 
ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 2 40 

lilllllillllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 2 40 

TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 
lllllllllllilllllllllllllllllllllllllllllllllillllllllllllll 
TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 3 60 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 
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1 


Qy 


61 
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61 


Qy 


121 


Db 


121 


Qy 


181 
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181 


Qy 


241 
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241 


Qy 


301 


Db 


301 


Qy 


361 



us-10-678-355-5.rnpb 

illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

II I II II 1 1 1 II MM I II III Mini Mill III I I I I llllllllllllllllllll 
AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 

GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

IMIIIIIIIMIIIIMIIIMMIIIIIIIIIIIIIIIIII lllllllllllllllll 

GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 
AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

IIIIMIIIIIIIIIMIIMIIIIII IMIIIMIIIIIIMMIII MM IIIMIM 

AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 
IIIIIIIIIIIIIIIIIIIMII IIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIII 



lllllllllllllllllllllllllllllllllllllllllllllll IIMIIIMIIII 

GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 8 

AGTGCATCTTCTGAGGTCAAT 861 

lllllllllllllllllllll 
AGTGCATCTTCTGAGGTCAAT 861 

RESULT 2 

US-10-171-311-175 

Sequence 175, Application US/10171311 
Publication No. US20030087270A1 
GENERAL INFORMATION: 
APPLICANT: Schlegel, Robert 
APPLICANT: Chen, Yan 
APPLICANT: Zhao, Xumei 
APPLICANT: Monahan, John 
APPLICANT: Kamatkar, shubhangi 
APPLICANT: Glatt, Karen 
APPLICANT: Gannavarapu, Manjula 
APPLICANT: Hoersh, Sebastian 

TITLE OF INVENTION: NOVEL GENES, COMPOSITIONS, KITS, AND METHODS FOR 
TITLE OF INVENTION: IDENTIFICATION, ASSESSMENT, PREVENTION, AND THERAPY 
TITLE OF INVENTION: OF CERVICAL CANCER 
FILE REFERENCE: MRI-035 

CURRENT APPLICATION NUMBER: US/10/171, 311 
CURRENT FILING DATE: 2002-06-12 
PRIOR APPLICATION NUMBER: US 60/298,159 
PRIOR FILING DATE: 2001-06-13 
PRIOR APPLICATION NUMBER: US 60/298,155 
PRIOR FILING DATE: 2001-06-13 
PRIOR APPLICATION NUMBER: US 60/335,936 
PRIOR FILING DATE: 2001-11-14 
NUMBER OF SEQ ID NOS: 238 
SOFTWARE: FastSEQ for Windows Version 4.0 
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us-10-678-355-5.rnpb 

; SEQ ID NO 175 
; LENGTH: 861 
; TYPE: DNA 

; ORGANISM: Homo sapiens 
US-10-171-311-175 



Query Match 100.0%; Score 861; DB 14; Length 861; 

Best Local Similarity 100.0%; Fred. No. 4.6e-255; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


ATGAGAATTGCAGTGATTTGC 1 1 1 IGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 


60 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


ATGAGAATTGCAGTGATTTGCI 1 1 1 GCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 


60 


Qy 


61 


CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 


120 




IlllCllftllllllllllilllllllllllllllllllllllllllllllllllllllli 

1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 


120 


Qy 


121 


GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 


180 


Qy 


181 


ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 


240 




iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 

#1 1 XJ \Jf \ 1 \ 1 \^r\I\\Jr^ 1 XJ/^ 1 KJf^ 1 XJ#\X* x*/^ 1 XJ 1 XJ XJ/^ X*/^XJ ^ XJ XJ/^X- 1 X* ^^/^ 1 1 XJAiX^ 1 XJ Avr^ XJ#%X* 


240 


Qy 


. 241 


TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 


300 




iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 M M 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 




Db 


241 


TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 


300 


Qy 


301 


GAATCTGATGAACTGGTCACTGAI 1 1 1 CCCACGGACCTGCCAGCAACCGAAGTTTTCACT 


360 




iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiitiiiiiiiiiiiii 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


GAATCTGATGAACTGGTCACTGA 1 1 1 1 CCCACGGACCTGCCAGCAACCGAAG MM CACT 


360 


Qy 


361 


CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 


420 




iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 


420 


Qy 


421 


AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 


480 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 


480 


Qy 


481 


CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 


540 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 


540 


Qy 


541 


GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 


600 




IIIIIIIMIIIIIIIIIIIIIIIMIIIMIIIIIIIIMMIIIIIIIIIIIIIIIII 




Db 


541 


GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 


600 


Qy 


601 


AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 


660 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


601 


AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 


660 


Qy 


661 


CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 


720 




llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


661 


CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 


720 


Qy 


721 


AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 


780 


Db 


721 


llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 


780 


Qy 


781 


GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 


840 
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lllllillllllllllllllllllllllllllllllllllllllilllllllllllllll 
Db 781 GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 

Qy 841 AGTGCATCTTCTGAGGTCAAT 861 

I III I I II III II lllll Mi 

Db 841 AGTGCATCTTCTGAGGTCAAT 861 

RESULT 3 

US-10-205-823-306 

Sequence 306, Application US/10205823 
Publication No. US20030108963A1 
GENERAL INFORMATION: 

APPLICANT: schleqel, Robert 
APPLICANT: Monahan, John E. 
APPLICANT: Endege, Wilson 0. 
APPLICANT: Gannavarapu, Maniula 
APPLICANT: Gorbatcheva, Bella 
APPLICANT: Hoersch, Sebastian 
APPLICANT: Kamatkar, shubhangi 
APPLICANT: Wonsey, Angela M. 
APPLICANT: Glatt, Karen 
APPLICANT: Zhao, xumei 
APPLICANT: Anderson, Dustin 

TITLE OF INVENTION: NOVEL GENES, COMPOSITIONS, KITS, AND 

TITLE OF INVENTION: METHODS FOR IDENTIFICATION, ASSESSMENT, PREVENTION, AND 
TITLE OF INVENTION: THERAPY OF PROSTATE CANCER 
FILE REFERENCE: MRI-044 

CURRENT APPLICATION NUMBER: US/10/205,823 
CURRENT FILING DATE: 2002-07-25 
PRIOR APPLICATION NUMBER: 60/307,982 
PRIOR FILING DATE: 2001-07-25 
PRIOR APPLICATION NUMBER: 60/314,356 
PRIOR FILING DATE: 2001-08-22 
PRIOR APPLICATION NUMBER: 60/325,020 
PRIOR FILING DATE: 2001-09-25 
PRIOR APPLICATION NUMBER: 60/341,746 
PRIOR FILING DATE: 2001-12-12 
PRIOR APPLICATION NUMBER: 60/362,158 
PRIOR FILING DATE: 2002-03-05 
NUMBER OF SEQ ID NOS : 455 

SOFTWARE: FastSEQ for windows Version 4.0 
SEQ ID NO 306 
LENGTH: 861 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-205-823-306 

Query Match 100.0%; Score 861; DB 15; Length 861; 

Best Local Similarity 100.0%; Pred. No. 4.6e-255; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllilllll 
Db 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

Qy 61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

IIIMIIIIIIIIIIIIIIIIIIIIIIlllllllllllllllllllllllllllllllll 
Db 61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

Qy 121 GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

II II I I II III III II II III I MM I I II lllll III III I II MM II I II III II II 

Db 121 GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 
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Oy 181 ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 240 

_ lllllllllillllllllllllllllllllllllllllllllllllllllllllllllll 

Db 181 ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 240 

Qy 241 TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

lllilllllllllllllllllllllllllllllillMlllllllllllllllliillll 

Db 241 TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

Qy 301 GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 

Db 301 GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 

Qy 361 CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

. lllllllllllllillMIIIIIIMIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIII 

Db 361 CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

Qy 421 AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

IIMMIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIllll 

Db 421 AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

Qy 481 CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 1 1 1 
Db 481 CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5.40 

Qy 541 GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

^. IMIIIIIIIIIIIIIIIIIIIIlllllllllllllllMlllllliiiiiiiiiiiiii 

Db 541 GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

Qy 601 AGTC^GCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGAT^^ 660 

^^1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 601 AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCC^GATTATAW^ 660 

Qy 661 CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGG^ 720 

^. 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Do 661 CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

Qy 721 AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGT^^ 780 

^,1 I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 1 
Db 721 AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

Qy 781 GACCCCAAAAGTMGGAAGAAGATAAACACCTGAAATTTCGTAT^ 840 

Db 781 GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 

Qy 841 AGTGCATCTTCTGAGGTCAAT 861 

„^ lllllllllllllllllllll 
Db 841 AGTGCATCTTCTGAGGTCAAT 861 

RESULT 4 

US-10-177-293-339 

Sequence 339, Application US/10177293 
Publication No. US20030124128A1 
GENERAL INFORMATION: 
APPLICANT: Li Hie, 3ames 
APPLICANT: Glatt, Karen 
APPLICANT:- Zhao, Xumei 
APPLICANT: Gannavarpu, Manjula 
APPLICANT: Kamatkar, shubhangi 
APPLICANT: Martens, Maureen 
APPLICANT: Myer, Vic 
APPLICANT: Wang, Youzhen 
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APPLICANT: xu, Yongyao 
APPLICANT: Hoersch, Sebastian 
APPLICANT: Monahan, John 
APPLICANT: Meyers, Rachel E. 
APPLICANT: Bast Jr., Robert C. 
APPLICANT: Hortobagyi , Gabriel N. 
APPLICANT: Pusztai, Lajos 
APPLICANT: Meric, Funda 
APPLICANT: sahin, Aysegul 
APPLICANT: Mills, Gordon B. 

TITLE OF INVENTION: COMPOSITIONS, KITS, AND METHODS FOR IDENTIFICATION, 
ASSESSMENT, 

TITLE OF INVENTION: PREVENTION, AND THERAPY OF BREAST CANCER 
FILE REFERENCE: MRI-038 

CURRENT APPLICATION NUMBER: US/10/177 , 293 
CURRENT FILING DATE: 2002-06-21 
PRIOR APPLICATION NUMBER: US 60/299,887 
PRIOR FILING DATE: 2001-06-21 
PRIOR APPLICATION NUMBER: US 60/301,572 
PRIOR FILING DATE: 2001-06-27 
PRIOR APPLICATION NUMBER: US 60/306,501 
PRIOR FILING DATE: 2001-07-18 
PRIOR APPLICATION NUMBER: US 60/325,002 
PRIOR FILING DATE: 2001-09-25 
PRIOR APPLICATION NUMBER: US 60/362,585 
PRIOR FILING DATE: 2002-03-05 
PRIOR APPLICATION NUMBER: US 60/xxX,XXX 
PRIOR FILING DATE: 2002-05-14 
NUMBER OF SEQ ID NOS: 506 
SOFTOARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 339 
LENGTH: 861 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-lO-177-293-339 

Query Match 100.0%; Score 861; DB 15; Length 861; 

Best Local Similarity 100.0%; Pred. No. 4.6e-255; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 



0; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 
CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 

lllllllllllllllll Mill Mill I'llllMIIIIIIIIIIIIIIIIIIIIIIIIM 

CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 

GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 

lljlllllllllllllllllllilMIIIIIMIMIIIIIIIIIIIIIIIIIIIIIMI 
GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 

ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIII 
ATGGATGATGAAGATGATGATGACCATGTGGACAGCCA6GACTCCATTGACTCGAACGAC 

TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 
TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 



GAATCTGATGAACTGGTCACTGATT 
lllllllllllllll 



TCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 
llllllllllllllllllllllllllllllllll 



120 
120 
180 
180 
240 
240 
300 
300 
360 



GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 
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CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

lllllllllllllllllllllllllllllllllllillllllllllllllllllllllll 
CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIMMIIIIM 

CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 
GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 
AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

lllllllllllllllllllllllllllillillllllllllllMIIIIIIIIIIMIII 

AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 
CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

llllllll MINI III II III II Mill I INI I IMII Mill I INI I II III I II I 

CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 
AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

MMMMMIMIMIIMIMIMMMMIIMIMIMMIMIMMMIIIMI 

AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 
GACCCCAAAAGTAAGGAAGAAGATAAACACCTG AAATTTCGTATTTCTC ATGAATTAG AT 8 40 

I M M I II II I M I M I II M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II M I M 1 1 1 1 1 M II 1 1 1 1 1 1 II 

GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 

AGTGCATCTTCTGAGGTCAAT 861 
lllllllllllllllllllll 



RESULT 5 

US-10-301-822-146 

sequence 146, Application US/10301822 
Publication No. US20030148410A1 
GENERAL INFORMATION: 

APPLICANT: Millennium Pharmaceuticals, Inc. 
APPLICANT: Berger, Allison 
APPLICANT: Guillemette, Tracy L. 
APPLICANT: Kamatkar, shubhangi 
APPLICANT: schlegel, Robert 
APPLICANT: Monahan, John E. 
APPLICANT: Thibodeau, Stephen N. 
APPLICANT: Burgart, Lawrence J. 

TITLE OF INVENTION: NOVEL GENES, COMPOSITIONS, KITS, AND 

TITLE OF INVENTION: METHODS FOR IDENTIFICATION, ASSESSMENT, PREVENTION, AND 
TITLE OF INVENTION: THERAPY OF COLON CANCER 
FILE REFERENCE: MPM01-029P2RNM 
CURRENT APPLICATION NUMBER: US/10/301, 822 
CURRENT FILING DATE: 2002-11-21 
PRIOR APPLICATION NUMBER: US 60/339,971 
PRIOR FILING DATE: 2001-12-10 
PRIOR APPLICATION NUMBER: US 60/361,978 
PRIOR FILING DATE: 2002-03-05 
PRIOR APPLICATION NUMBER: US 60/381,988 
PRIOR FILING DATE: 2002-05-20 
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Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 



us-10-678-355-5.rnpb 

NUMBER OF SEQ ID NOS: 228 
SOFTV/ARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 146 
LENGTH: 861 
TYPE: DNA 

ORGANISM: Homo Sapiens 

FEATURE : 
NAME/KEY: CDS 
LOCATION: (1)...(861) 
US-10-301-822-146 

Query Match 100.0%; Score 861; DB 15; Length 861; 

Best Local similarity 100.0%; Pred. No. 4.6e-255; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIMIM 

ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 
CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 12 0 

llllllllllilllllllllllllllllllllllllllllMlllllillllllllllM 

CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 
GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

lllilillMlllllllllllllllllllllillllllllllllllllllllllllilll 

GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 
ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 2 40 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIill 

ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 2 40 
TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 
GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 3 60 

IIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIllllllllM 

GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGC AACCG AAGTTTTCACT 360 

CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 42 0 
llilllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 
illlllllllllllllllllllllllllllllilllllllllllllllllllllllllll 
CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 

GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIMIIIIIIIIIIMIIIIMMIIII 

AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

Page 10 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 



us-10-678-355-5.rnpb 
QV 721 AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 721 AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

oy 781 GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 

IIIIIIIIIIIIIIIIIIIIIMIIIIIilllllMIIMIIIIIIIIIIMIIIIIIII 

Db 781 GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 

Qy 841 AGTGCATCTTCTGAGGTCAAT 861 

lllllllllilllllllllll 
Db 841 AGTGCATCTTCTGAGGTCAAT 861 

RESULT 6 
US-10-678-355-5 

Sequence 5, Application US/10678355 
Publication No. US20040142865A1 
GENERAL INFORMATION: 
APPLICANT: Weber, Georg F. 

TITLE OF INVENTION: OSTEOPONTIN-BASED CANCER THERAPIES 
FILE REFERENCE: 00398-148001 
CURRENT APPLICATION NUMBER: US/10/678 , 355 
CURRENT FILING DATE: 2003-10-02 
PRIOR APPLICATION NUMBER: US 60/415,712 
PRIOR FILING DATE: 2002-10-02 
NUMBER OF SEQ ID NOS : 9 

SOFTWARE: FastSEQ for Windows Version 4.0 

SEQ ID NO 5 
LENGTH: 861 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-678-355-5 

Query Match 100.0%; Score 861; DB 18; Length 861; 

Best Local similarity 100.0%; Pred. No. 4.6e-255; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCC^ 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 Ml I MM MM I 

Db 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

Qy 61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCT 120 

^ IIIIIIIIIIIIIIMMMMMIMMMMIMMMMMMMMMMIMIM 

Db 61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

Qy 121 GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGG^^ 180 

IIIIIIIIIIIIIIIIIIMMMMMMMMMMMMMMMMMMMMM 

Db 121 GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

Qy 181 ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAA 240 

^ i 1 1 1 1 1 1 1 1 1 1 1 1 M M M M M M M M M M M M M M I M M M M M M M M M 

Db 181 ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 240 

Qy 241 TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

^ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M I M M I M M I M M M M M M M M M M M M M I 

Db 241 TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

OV 301 GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 

IIIIIIIIIIMIIIIIIMMIIMIIIIMIMMIMIMIIIMMIIMMMII 

Db 301 GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 

OV 361 CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

1 1 1 1 1 1 M M M M M M M M M M M M M M M M M M M M M M M M M M M 
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us-10-678-355-5.rnpb 
CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

IMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AGGTCAAAATCTAAGAAGTTTCGGAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 

GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

IIIIIMIIIMIilllllllMIIMIIIIIIIIIIIIIillllllllllllllMIII 

AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 
GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII.IIM 

GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 
AGTGCATCTTCTGAGGTCAAT 861 

I III 1 11 III II I Mill III 

AGTGCATCTTCTGAGGTCAAT 861 

RESULT 7 

US-10-775-920-217 

sequence 217, Application US/10775920 
Publication No. US20040175744A1 
GENERAL INFORMATION: 
APPLICANT: Mergen Ltd 

TITLE OF INVENTION: POLYNUCLEOTIDE SEQUENCES AND CORRESPONDING ENCODED 
POLYPEPTIDES 

TITLE OF INVENTION: OF PARTICULAR SECRETED AND MEMBRANE-BOUND PROTEINS 
OVEREXPRESSED 

TITLE OF INVENTION: IN CERTAIN CANCERS 
FILE REFERENCE: Mergen - OOlOB 
CURRENT APPLICATION NUMBER: US/10/775 , 920 
CURRENT FILING DATE: 2004-02-10 
PRIOR APPLICATION NUMBER: US 60/447,900 
PRIOR FILING DATE: 2003-02-13 
NUMBER OF SEQ ID NOS: 385 
software: Patentin version 3.2 

SEQ ID NO 217 

length: 861 
TYPE: dna 

organism: Homo sapiens 
US-10-775-920-217 

Query Match 100.0%; Score 861; DB 18; Length 861; 

Best Local Similarity 100.0%; Pred. No. 4.6e-255; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
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Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 



us- 10-6 78-355-5.rnpb 

Qy 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

llllilllllllMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

Db 1 ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

Qy 61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

^ IIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIjllMIIIII 

Db 61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

Qy 121 GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

Db 121 GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

Qy 181 ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 240 

lllllllllllllllllllllllllllllllllllllllllllllllllllillllllll 
Db 181 ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 240 

Qy 241 TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

lilllllllNIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 241 TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

Qy 301 GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 

Qy 361 CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 361 CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

Qy 421 AGGTC>^^AATCTAAGAAGTTTCGC^ 480 

^. Nllillllllllllllllllllllllllllllllllllllliiiiiiiiiiiiiiiiii 

Db 421 AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

Qy 481 CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 540 

»u ^o. '<<"><>">>>">< XIIIMIIIIIIIIIIIIIIIIIIIIIIIIIllllllliiii 

Db 481 CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGC^TAC^GGCCA^ 540 

Qy 541 GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

»u >llllllllllllllllllllll||||||||||||||||||||||||||||||||||||| 

Db 541 GCCCAGGACCTGAACGCGCCITCTGATTGGGACAGCCGTGGGAAG(UCAGTTA^ 600 

Qy 601 AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAG^ 660 

»U cn-. ■> I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 agtcagctggatgaccagagtgctgaagcccacagccacaagcagtccagA™ 660 

Qy 661 cggaaagctaatgatgagagcaatgagcattccgatgtgattgatagtcaggaactttcc 720 

nh «A1 J-iilllJ-U'l'""""" II II I II III I I II III I II I II II I I 

DD 661 cggaaagctaatgatgagagcaatgagcattccgatgtgattgatagtcaggaactttcc 720 

Qy 721 aaagtcagccgtgaattccacagccatgaatttcacagccatgaagatatgctggttgta 780 

nh 7.1 UiUJ-l'""" Illllllllllllllllllllllllllllllllllllllll 

Db 721 aaagtcagccgtgaattccacagccatgaatttcacagccatgaagatatgctggttgta 780 

Qy 781 gaccccaaaagtaaggaagaagataaacacctgaaatttcgtatttctcatgaattagat 840 

nh 7«i iUUIilllUilUiU' I I I II I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I 

Db 781 gaccccaaaagtaaggaagaagataaacacctgaaatttcgtatttctcatgaattagat 840 

Qy 841 agtgcatcttctgaggtcaat 861 

„^ II III III II nil III MM 
Db 841 agtgcatcttctgaggtcaat 861 
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GenCore version 5,1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



May 5, 2005, 02:15:58 ; Search time 3114 Seconds 

(without alignments) 
10524.514 Million cell updates/sec 

US-10-678-355-5 
861 

1 atgagaattgcagtgatttg gtgcatcttctgaggtcaat 861 

IDENTITY_NUC 

Gapop 10.0 / Gapext 1.0 



Searched: 34239544 seqs, 19032134700 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



68479088 



Database 



EST : ■ 



1 


: gb_ 


estl:* 


2 


: gb 


"est2:* 


3 


gb 


]htc: * 


4 


gb 


"est3:* 


5 


gb 


est4 : * 


6 


gb 


"est5:* 


7 


gb_ 


]est6:* 


8 


gb 


gssl : * 


9 


gb 


_gss2 : * 



Fred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No 



% 

Query 



. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


856.2 


99.4 


1349 


3 


CR617515 


CR617515 full-leng 


2 


846.6 


98.3 


1060 


5 


BX379560 


BX379560 BX379560 


3 


826.8 


96.0 


1070 


1 


AL546356 


AL546356 AL546356 


4 


772 


89.7 


1137 


1 


AL542602 


AL542602 AL542602 


5 


772 


89.7 


1341 


3 


CR610632 


CR610632 full-leng 


6 


772 


89.7 


1360 


3 


CR617116 


CR617116 full-leng 


7 


772 


89.7 


1361 


3 


CR615698 


CR615698 full-leng 


8 


772 


89.7 


1372 


3 


CR594254 


CR594254 full-leng 



9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 



772 
772 
772 
772 
772 
772 
772 
772 
772 
772 
772 
772 
772 
772 
772 
771.6 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 
766.2 



89.7 
89.7 
89.7 
89.7 
89.7 
89.7 
89.7 



89 

89 
89 
89 
89 
89 
89 
89 
89 
89 
89 
89 
89 
89 
89 
89 
89 
89 
89 
89 
89, 
89, 
89. 
89. 
89, 
89.0 
89.0 
89.0 
89.0 
89.0 



1372 
1383 
1391 
1393 
1398 
1411 
1413 
1415 
1421 
1422 
1422 
1470 
1474 
1502 
1565 
993 
945 
1145 
1214 
1406 
1417 
1419 
1423 
1424 
1426 
1426 
1427 
1428 
1435 
1437 
1439 
1440 
1440 
1442 
1443 
1446 
1448 



3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

5 

9 

5 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 



CR604739 

CR593792 

CR612712 

CR607920 

CR608709 

CR615383 

CR593987 

CR624945 

CR594667 

CR618369 

CR622075 

CR625747 

CR614191 

CR606646 

CR602900 

BX397759 

AY417969 

BX438909 

CR624176 

CR609665 

CR6d3247 

CR590765 

CR596445 

CR605902 

CR591447 

CR613613 

CR596475 

CR591466 

CR618220 

CR601143 

CR599125 

CR610252 

CR626471 

CR600714 

CR616615 

CR600744 

CR619881 



CR604739 

CR593792 

CR612712 

CR607920 

CR608709 

CR615383 

CR593987 

CR624945 

CR594667 

CR618369 

CR622075 

CR625747 

CR614191 

CR606646 

CR602900 

BX397759 

AY417969 

BX438909 

CR624176 

CR609665 

CR603247 

CR590765 

CR596445 

CR605902 

CR591447 

CR613613 

CR596475 

CR591466 

CR618220 

CR601143 

CR599125 

CR610252 

CR626471 

CR600714 

CR616615 

CR600744 

CR619881 



full-leng 

f ull-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

BX397759 

Homo sapi 

BX438909 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 

full-leng 



ALIGNMENTS 



RESULT 1 
CR617515 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



CR617515 1349 bp mRNA linear HTC 21-JUL-2004 

full-length cDNA clone CS0DI031YL18 of Placenta Cot 25-norniali2ed 
of Homo sapiens (human) . 
CR617515 

CR617515. 1 GI: 50498322 
HTC; CNSLT_cDNA. 
Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1349) 

Li,W.B., Gruber^C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 



Craniata; Vertebrata; Euteleostomi; 
Catarrhini; Hominidae; Homo. 



JOURNAL 
REMARK 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



Unpublished 

Contact : Feng Liang Email : fliang@lifetech.com URL : 

http: //fulllength. invitrogen, com/ InVitroGen Corporation 1600 

Faraday Avenue 

2 (bases 1 to 1349) 

Genoscope . 

Direct Submission 

Submitted (20- JUL-2004 ) Genoscope - Centre National de Sequencage : 
BP 191 91006 EVRY cedex - FRANCE (E-mail : seqref@genoscope.cns.fr 
- Web : www.genoscope.cns.fr) 

1st strand cDNA was primed with a Notl-oligo (dT) primer. Five prime 
end enriched, double-strand cDNA was digested with Not I and cloned 
into the Not I and EcoR V sites of the pCMVSPORT 6 vector. Library 
was normalized. Library was constructed by Life Technologies, a 
division of Invitrogen. 

Location/Qualifiers 

1. .1349 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="CS0DI031YL18" 

/tissue__type="Placenta Cot 25-normalized" 
/plasmid="pCMVSP0RT_6" 



ORIGIN 



Query Match 99.4%; 
Best Local Similarity 99.7%; 
Matches 858; Conservative 



Score 856.2; DB 3; 
Fred. No. 1.8e-235; 
0; Mismatches 3; 



Length 1349; 



Indels 



0; Gaps 



0; 



Qy 

Db 



85 



ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

I I M I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I M I M I I I M 
ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 144 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



61 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I 
145 CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 204 

121 GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAPAGCCATGACCACATGGATGAT 180 
I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I 

205 GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 264 



181 



240 



ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I 
265 ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 324 

241 TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 
I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I 

325 TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 384 

301 GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 

I I I I I I I I I 1 I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
385 GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 444 



Qy 

Db 



361 CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
445 CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 504 



Qy 421 AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 505 AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 564 

Qy 4 81 CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 540 

I I I I I I I I I I I I I I I I M I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I IN 
Db 565 GACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 624 

Qy 541 GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 625 GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 684 

Qy 601 AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 685 AGTCAGCTGGATGACCAGAGTGCTGAAACCCACAGCCACAAGCAGTCCAGATTATATAAG 744 

Qy 661 CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 745 CGGAAAGCCAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 804 

Qy 721 AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 805 AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 864 

Qy 781 GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 865 GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 924 

Qy 841 AGTGCATCTTCTGAGGTCAAT 861 

I I I I I I I I I I I I I I i I I I I I I 
Db 925 AGTGCATCTTCTGAGGTCAAT 945 



us-10-678-355-5.rnpb 



GenCore version 5.1.6 
copyright (c) 1993 - 2005 Compugen Ltd. 



CM nucleic - nucleic search, using sw model 

Run on: May 5, 2005, 05:07:26 ; Search time 5671 Seconds 

(without alignments) 
928.345 Million cell updates/sec 

Title: US-10-678-355-5 
Perfect score: 861 

Sequence: 1 atgagaattgcagtgatttg gtgcatcttctgaggtcaat 861 

Scoring table: IDENTITY^NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 5654200 seqs, 3057283753 residues 

Total number of hits satisfying chosen parameters: 11308400 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Publ i shed_Appl i cati ons_NA : 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 



/cgn2_6/ptodata/2/pubpna/US07_PUBC0MB . seq : * 
/cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/US06_NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/US06_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/US07_NEW_PUB . seq : * 
/cgn2„6/ptodata/2/pubpna/PCTUS_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/us08_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/us09A^PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/us09_NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/USl0/UPUBCOMB . seq 
/cgn2_.6/ptodata/2/pubpna/USl0B_PUBCOMB . seq 
/cgn2_6/ptodata/2/pubpna/USl0C_PUBCOMB . seq 
/cgn2_6/ptodata/2/pubpna/USl0D_PUBCOMB . seq 
/cgn2_6/ptodata/2/pubpna/USl0E_PUBCOMB . seq 
/cgn2_6/ptodata/2/pubpna/uslOF_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/USl0_NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/USll_NEW.PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/US60«NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/US60_PUBCOMB . seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



% 

Result Query 

No. score Match Length DB ID 
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Description 

Sequence 230, App 



861 100.0 861 14 US-10-097-340-230 

Page 1 



2 


861 


100.0 


861 


14 


US 


3 


861 


100.0 


861 


15 


us 


4 


861 


100.0 


861 


15 


us 


5 


861 


100.0 


861 


15 


us 


6 


861 


100.0 


861 


18 


us 


7 


861 


100.0 


861 


18 


us 


8 


778.4 


90.4 


900 


18 


us 


9 


778.4 


90.4 


900 


18 


us 


10 


772 


89.7 


1447 


17 


us 


11 


772 


89.7 


1447 


18 


us 


12 


772 


89.7 


1447 


18 


us 


13 


772 


89.7 


1447 


18 


us 


14 


772 


89.7 


1469 


9 


US- 


15 


772 


89.7 


1469 


19 


US 


16 


772 


89.7 


1486 


18 


us 


17 


772 


89.7 


1520 


18 


us 


18 


772 


89.7 


1524 


14 


US 


19 


772 


89.7 


1524 


14 


us 


20 


772 


89.7 


1524 


15 


us 


21 


772 


89.7 


1524 


15 


us 


22 


772 


89.7 


1524 


15 


US 


23 


772 


89.7 


1524 


16 


us 


24 


772 


89.7 


1524 


17 


us 


25 


772 


89.7 


1524 


17 


us 


26 


772 


89.7 


1524 


17 


us 


27 


772 


89.7 


1524 


17 


us 


28 


772 


89.7 


1524 


17 


us 


29 


772 


89.7 


1524 


17 


us 


30 


772 


89.7 


1524 


17 


us 


31 


772 


89.7 


1524 


18 


us 


32 


772 


89.7 


1524 


18 


us 


33 


772 


89.7 


1524 


18 


us 


34 


772 


89.7 


1524 


18 


us 


35 


772 


89.7 


1524 


18 


us 


36 


772 


89.7 


1524 


19 


us 


37 


772 


89.7 


1524 


19 


us 


38 


772 


89.7 


1524 


19 


us 


39 


772 


89.7 


1549 


18 


us 


40 


772 


89.7 


1734 


18 


us 


41 


772 


89.7 


2209 


14 


us 


42 


771 


89.5 


942 


14 


us 


43 


771 


89.5 


942 


14 


us 


44 


771 


89.5 


942 


15 


us 


45 


771 


89.5 


942 


15 


us 



us-10-678-355-5 
10-171-311 



-355 



rnpb 
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-306 
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-221 
-209 
1 

-9 

-210 
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Sequence 
Sequence 
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Sequence 
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Sequence 
Sequence 
Sequence 
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175, App 
306, App 
339, App 
146, App 

5, Appli 

217, App 
3, Appli 

218, App 
1270, Ap 
188, App 
221, App 

209, App 
1, Appli 

9, Appli 

210, App 

207, App 
228, App 
173, App 
304, App 
337, App 
144 , App 
70, Appl 
268, App 
507, App 
1116, Ap 
11, Appl 
392, App 

6, Appli 
6, Appli 
279, App 

6, Appli 

7, Appli 

8, Appli 

208, App 
282, App 
796, App 
1, Appli 
215, App 
7782, Ap 
13927, A 
226, App 
171, App 
302, App 
335, App 



RESULT 4 

US-10-177-293-339 

Sequence 339, Application US/10177293 
Publication No. US20030124128A1 
GENERAL INFORMATION: 
APPLICANT: Li 1 lie, James 
APPLICANT: Glatt, Karen 
APPLICANT: Zhao, xumei 
APPLICANT: Gannavarpu, Manjula 
APPLICANT: Kamatkar, shubhangi 
Mertens, Maureen 
Myer, Vic 
Wang, Youzhen 
APPLICANT: Xu, Yongyao 
APPLICANT: Hoersch, Sebastian 
APPLICANT: Monahan, John 



APPLICANT: 
APPLICANT: 
APPLICANT: 
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us-10-678-355-5.rnpb 

APPLICANT: Meyers, Rachel E. 
APPLICANT: Bast Jr., Robert C. 
APPLICANT: Hortobagyi, Gabriel N. 
APPLICANT: Pusztai , Lajos 
APPLICANT: Meric, Funda 
APPLICANT: Sahin, Aysegul 
APPLICANT: Mills, Gordon B. 

TITLE OF INVENTION: COMPOSITIONS, KITS, AND METHODS FOR IDENTIFICATION, 
ASSESSMENT, 

TITLE OF INVENTION: PREVENTION, AND THERAPY OF BREAST CANCER 
FILE REFERENCE: MRI-038 

CURRENT APPLICATION NUMBER: US/10/177 , 293 
CURRENT FILING DATE: 2002-06-21 
PRIOR APPLICATION NUMBER: US 60/299,887 
PRIOR FILING DATE: 2001-06-21 
PRIOR APPLICATION NUMBER: US 60/301,572 
PRIOR FILING DATE: 2001-06-27 
PRIOR APPLICATION NUMBER: US 60/306,501 
PRIOR FILING DATE: 2001-07-18 
PRIOR APPLICATION NUMBER: US 60/325,002 
PRIOR FILING DATE: 2001-09-25 
PRIOR APPLICATION NUMBER: US 60/362,585 
PRIOR FILING DATE: 2002-03-05 
PRIOR APPLICATION NUMBER: US 60/xXX,XXX 
PRIOR FILING DATE: 2002-05-14 
NUMBER OF SEQ ID NOS : 506 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 339 
LENGTH: 861 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-177-293-339 

Query Match 100.0%; Score 861; DB 15; Length 861; 

Best Local Similarity 100.0%; Pred. No. 4.6e-255; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 12 0 

llllllllllllllllllllllllllllllllllllllllilllllllllllllllilll 
CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 2 40 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 2 40 

TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 
GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 3 60 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 

CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 42 0 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
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61 
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121 
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181 
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301 
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361 



us-10-678-355-5.rnpb 
Db 361 CCAGTTGTCCCCACAGTAGACACATATGATGGCCGAGGTGATAGTGTGGTTTATGGACTG 420 

QV 421 AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

Db 421 AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

QV 481 CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 481 CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 540 

Qy 541 GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIillllillllllll 
Db 541 GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

QV 601 AGTCAGCTGGATGACCAGAGTGCTGAA6CCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

lllllllllllllllllllllllllllllllllllllillllllllilllllllillill 
Db 601 AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

QV 661 CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

llllllllllllllllllllllllilllllllllllllllllllllllllllllllllll 
Db 661 CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

Qy 721 AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

lllllllllllllllllllilllllllllillllllllllllllllllllilllllllll 
Db 721 AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

Qy 781 GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

Db 781 GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 

Qy 841 AGTGCATCTTCTGAGGTCAAT 861 

lllllilllllllllllllll 
Db 841 AGTGCATCTTCTGAGGTCAAT 861 
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us-10-678-355-5.rnpb 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: May 5, 2005, 05:07:26 ; Search time 5671 Seconds 

(without alignments) 
928.345 Million cell updates/sec 

Title: US-10-678-355-5 
Perfect score: 861 

sequence: 1 atgagaattgcagtgatttg gtgcatcttctgaggtcaat 861 

scoring table: lDENTlTy_NUC 

Gapop 10.0 , Gapext 1.0 

searched: 5654200 seqs, 3057283753 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



11308400 



Database 



Publ i shed_Appl i cati ons_NA: 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 



/cgn2_6/ptodata/2/pubpna/US07_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/US06_NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/us06_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/us07_NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/PCTUS„PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/US09A^PUBCOMB . seq : * 
/cgn2„6/ptodata/2/pubpna/US09B_PUBCOMB . seq : * 
/cgn2.6/ptodata/2/pubpna/US09C_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/US09_NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/USl0A_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/USl0B_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/USl0C_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/USl0D_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/USlOE_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/USlOF_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/USlO_NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/uslLNEW«PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/us60_NEW_PUB . seq : * 
/cgn2_6/ptodata/2/pubpna/US60_PUBCOMB . seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



% 

Result Query 

No. score Match Length DB ID 



SUMMARIES 



Description 

sequence 230, App 
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175, App 
306, App 
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146, App 

5, Appli 

217, App 
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218, App 
1270, Ap 
188, App 
221, App 
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228, App 
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ALIGNMENTS 



RESULT 1 

US-10-097-340-230 

Sequence 230, Application US/10097340 
Publication No. US20030087250A1 
GENERAL INFORMATION: 
APPLICANT: John MONAHAN 

APPLICANT: Manjula gannavarapu 

APPLICANT: Sebastian HOERSCH 

APPLICANT: Shubhangi KAMATKAR 

APPLICANT: Steve G. KOVATS 

APPLICANT: Rachel E. MEYERS 

APPLICANT: Michael MORRISEY 

APPLICANT: Peter OLANDT 

APPLICANT: Ami SEN 
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us-10-678-355-5.rnpb 

APPLICANT: Peter VEIBY 
APPLICANT: Gordon B. MILLS 
APPLICANT: Robert C, BAST, Jr. 
APPLICANT: Karen LU 

APPLICANT: Rosemarie SCHMANDT . . 

APPLICANT: xumei ZHAO 
APPLICANT: Karen GLATT 

TITLE OF INVENTION: Nuclei c Acid Molecules and Proteins For The Identificati 
TITLE OF INVENTION: Assessment, Prevention, and Therapy of Ovarian Cancer 
FILE REFERENCE: MRI-030 

CURRENT APPLICATION NUMBER: US/10/097 , 340 
CURRENT FILING DATE: 2002-03-14 
PRIOR APPLICATION NUMBER: 60/276,025 
PRIOR FILING DATE: 2001-03-14 
PRIOR APPLICATION NUMBER: 60/325,149 

PRIOR FILING DATE: 2001-09-26 ^ , . 

PRIOR APPLICATION NUMBER: 60/276,026 ^ OId^ , \ V 

PRIOR FILING DATE: 2001-03-14 *^ 
PRIOR APPLICATION NUMBER: 60/324,967 
PRIOR FILING DATE: 2001/09/26 
PRIOR APPLICATION NUMBER: 60/311,732 
PRIOR FILING DATE: 2001-08-10 
PRIOR APPLICATION NUMBER: 60/325,102 
PRIOR FILING DATE: 2001-09-26 
PRIOR APPLICATION NUMBER: 60/323,580 
PRIOR FILING DATE: 2001-09-19 
NUMBER OF SEQ ID NOS: 363 

SOFTWARE: FastSEQ for Windows version 4.0 
SEQ ID NO 230 
LENGTH: 861 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-097-340-230 

Query Match 100.0%; Score 861; DB 14; Length 861; 

Best Local similarity 100.0%; Pred. No. 4.6e-255; 

Matches 861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
ATGAGAATTGCAGTGATTTGCTTTTGCCTCCTAGGCATCACCTGTGCCATACCAGTTAAA 60 

CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 

IIIIIMIIIIIillllllllllllllllllMllllllllllllllllllllllillll 

CAGGCTGATTCTGGAAGTTCTGAGGAAAAGCAGAATGCTGTGTCCTCTGAAGAAACCAAT 120 
GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

llllllllllllllllllllllllllllllllllllllllllllllllilllllMIIII 

GACTTTAAACAAGAGACCCTTCCAAGTAAGTCCAACGAAAGCCATGACCACATGGATGAT 180 

ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 2 40 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
ATGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTGACTCGAACGAC 2 40 

TCTG ATGATGTAG ATG ACACTG ATG ATTCTCACC AGTCTGATGAGTCTCACCATTCTGAT 3 00 

lllllllllllllllllllllllllllllllillllllllllllllllllllllllllll 
TCTGATGATGTAGATGACACTGATGATTCTCACCAGTCTGATGAGTCTCACCATTCTGAT 300 

GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GAATCTGATGAACTGGTCACTGATTTTCCCACGGACCTGCCAGCAACCGAAGTTTTCACT 360 
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661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 



us-10-678-355-5.rnpb 
illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



420 



AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllll nil 

AGGTCAAAATCTAAGAAGTTTCGCAGACCTGACATCCAGTACCCTGATGCTACAGACGAG 480 

CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 

llllilllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTT 5 40 

GCCCA6GACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACG 600 

AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIII 

AGTCAGCTGGATGACCAGAGTGCTGAAGCCCACAGCCACAAGCAGTCCAGATTATATAAG 660 

CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 
llllllllllllllllllllllllllllllllllllllllllllllllllilllllllll 
CGGAAAGCTAATGATGAGAGCAATGAGCATTCCGATGTGATTGATAGTCAGGAACTTTCC 720 

AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 
lllllllllllllllllllllllllllllillllllllllllllllllllllllllllll 
AAAGTCAGCCGTGAATTCCACAGCCATGAATTTCACAGCCATGAAGATATGCTGGTTGTA 780 

GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 
lllllllllllllllllllllllllllillllllllllllllllllllllllllllllll 
GACCCCAAAAGTAAGGAAGAAGATAAACACCTGAAATTTCGTATTTCTCATGAATTAGAT 840 

AGTGCATCTTCTGAGGTCAAT 861 

IIIIIIIIIIIIIIIIIIIM 

AGTGCATCTTCTGAGGTCAAT 861 
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